Using Data Mining to Construct an Intelligent Web Search System

نویسندگان

  • Yu-Ru Chen
  • Ming-Chuan Hung
  • Don-Lin Yang
چکیده

In this paper, we present a new ranking algorithm and an intelligent Web search system using data mining techniques to search and analyze Web documents in a more flexible and effective way. Our method takes advantage of the characteristics of Web documents to extract, find, and rank data in a more meaningful manner. We utilize hyperlink structures with Web document content to intelligently rank the retrieved results. It can solve ranking problems of existing algorithms for multiframe Web documents and unrelated linked documents. In addition, we use domain specific ontologies to improve our query process and to rank retrieved Web documents with better semantic notion. Furthermore, we use association rule mining to find the patterns of maximal keyword sets, which represent the main characteristics of the retrieved documents. For subsequent queries, these keywords become recommended sets of query terms for users’ specific needs. Clustering is used to group retrieved documents into distinct sets that can help users make their decisions easier and faster. Experimental results show that our Web search system is indeed effective and efficient.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Intelligent Health Solution System

Introduction: In the field of management, the statistics and performance of the deputies and functions of the organization are always of great importance, which requires instant access to the latest status of the system under coverage and minimal forecast of the future situation, to provide quality services Also improve. All of this justifies the existence of an intelligent statistical system w...

متن کامل

ARTICLE IN PRESS Complementing search engines with online web mining agents

While search engines have become the major decision support tools for the Internet, there is a growing disparity between the image of the World Wide Web stored in search engine repositories and the actual dynamic, distributed nature of Web data. We propose to attack this problem using an adaptive population of intelligent agents mining the Web online at query time. We discuss the benefits and s...

متن کامل

Complementing search engines with online web mining agents

There is a mismatch between the static image of the World Wide Web stored in search engine repositories and the actual dynamic, distributed nature of Web data. We propose to attack this problem using an adaptive population of intelligent agents mining the Web online at query time. We discuss the bene ts and shortcomings of using dynamic search strategies versus the traditional static methods in...

متن کامل

A Technique for Improving Web Mining using Enhanced Genetic Algorithm

World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. Comput. Proc. Oriental Lang.

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2003